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CM 

We study the probability distribution F(u) of the maximum of smooth Gaussian fields 
defined on compact subsets of M. d having some geometric regularity. 

Our main result is a general expression for the density of F. Even though this is an 
implicit formula, one can deduce from it explicit bounds for the density, hence for the 
distribution, as well as improved expansions for 1 — F(u) for large values of u. 

The main tool is the Rice formula for the moments of the number of roots of a random 
system of equations over the reals. 

This method enables also to study second order properties of the expected Euler Charac- 
teristic approximation using only elementary arguments and to extend these kind of results 
to some interesting classes of Gaussian fields. We obtain more precise results for the "di- 
, rect method" to compute the distribution of the maximum, using spectral theory of GOE 

random matrices. 



> 



23 



C3 



Abstract 



1 Introduction and notations 



Let X = {X(t) : t G S} be a real-valued random field defined on some parameter set S and 
M := sup teS X(t) its supremum. 

& ■ 

The study of the probability distribution of the random variable M, i.e. the function 
Fm(u) := P{M < u} is a classical problem in probability theory. When the process is Gaussian, 
general inequalities allow to give bounds on 1 — Fm(u) = P{M > u} as well as asymptotic 
results for u — * +oo. A partial account of this well established theory, since the founding paper 
by Landau and Shepp [20] should contain - among a long list of contributors - the works of 
Marcus and Shepp [24] , Sudakov and Tsirelson [30] , Borell [13] [H] , Fernique [17] , Ledoux and 
Talagrand [22], Berman [IT] [12], Adler[2], Talagrand [32] and Ledoux[H]. 

During the last fifteen years, several methods have been introduced with the aim of ob- 
taining more precise results than those arising from the classical theory, at least under certain 
restrictions on the process X, which are interesting from the point of view of the mathematical 
theory as well as in many significant applications. These restrictions include the requirement 
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the domain S to have certain finite-dimensional geometrical structure and the paths of the 
random field to have a certain regularity. 

Some examples of these contributions are the double sum method by Piterbarg [28] ; the 
Euler-Poincare Characteristic (EPC) approximation, Taylor, Takemura and Adler |34j . Adler 
and Taylor [5J; the tube method, Sun [51] and the well- known Rice method, revisited by Azai's 
and Delmas [5], Azai's and Wschebor [6]. See also Rychlik [29] for numerical computations. 

The results in the present paper are based upon Theorem [5J which is an extension of Theorem 
3.1 in Azai's and Wschebor [8] allowing to express the density pm of Fm by means of a general 
formula. Even though this is an exact formula, it is only implicit as an expression for the 
density, since the relevant random variable M appears in the right-hand side. However, it can 
be usefully employed for various purposes. 

First, one can use Theorem [3] to obtain bounds for pm(u) and thus for P{M > u} for 
every u by means of replacing some indicator function in (j3]) by the condition that the normal 
derivative is "extended outward" (see below for the precise meaning). This will be called the 
"direct method". Of course, this may be interesting whenever the expression one obtains can 
be handled, which is the actual situation when the random field has a law which is stationary 
and isotropic. Our method relies on the application of some known results on the spectrum of 
random matrices. 

Second, one can use Theorem[5]to study the asymptotics of P{M > u} as u — > +oo. More 
precisely, one wants to write, whenever it is possible 

1 u 2 

P{M >u}= A{u) exp ( - --,) + B{u) (1) 

where A(u) is a known function having polynomially bounded growth as u — > +oo, a 2 = 
sup tgS Var(A(t)) and B(u) is an error bounded by a centered Gaussian density with variance 
af, o\ < a 2 . We will call the first (respectively the second) term in the right-hand side of (pQ) 
the "first (resp second) order approximation of P{M > u}." 

First order approximation has been considered in [5] [M] by means of the expectation of the 
EPC of the excursion set E u := {t 6 S : X(t) > u}. This works for large values of u. The same 
authors have considered the second order approximation, that is, how fast does the difference 
between P{M > u} and the expected EPC tend to zero when u — ► +oo. 

We will address the same question both for the direct method and the EPC approxima- 
tion method. Our results on the second order approximation only speak about the size of the 
variance of the Gaussian bound. More precise results are only known to the authors in the 
special case where S is a compact interval of the real line, the Gaussian process X is stationary 
and satisfies a certain number of additional requirements (see Piterbarg [28] and Azai's et al. [1]). 

Theorem [5] is our first result in this direction. It gives a rough bound for the error B(u) as 
u — > +oo, in the case the maximum variance is attained at some strict subset of the face in S 
having the largest dimension. We are not aware of the existence of other known results under 
similar conditions. 

In Theorem [6] we consider processes with constant variance. This is close to Theorem 4.3 
in [53]. Notice that Theorem [6] has some interest only in case sup tg £ «t < oo, that is, when 
one can assure that o\ < a 2 in (P). This is the reason for the introduction of the additional 
hypothesis k(S) < oo on the geometry of S, (see below (|64p for the definition of k(S)), which 
is verified in some relevant situations (see the discussion before the statement of Theorem [6]). 

In Theorem [71 S is convex and the process stationary and isotropic. We compute the exact 
asymptotic rate for the second order approximation as u — > +oo corresponding to the direct 
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method. 



In all cases, the second order approximation for the direct method provides an upper bound 
for the one arising from the EPC method. 

Our proofs use almost no differential geometry, except for some elementary notions in Eu- 
clidean space. Let us remark also that we have separated the conditions on the law of the 
process from the conditions on the geometry of the parameter set. 

Third, Theorem and related results in this paper, in fact refer to the density pm of 
the maximum. On integration, they imply immediately a certain number of properties of the 
probability distribution Fm, such as the behaviour of the tail as u — > +00. 

Theorem [3] implies that Fm has a density and we have an implicit expression for it. The 
proof of this fact here appears to be simpler than previous ones (see Aza'is and Wschebor [8]) 
even in the case the process has 1-dimensional parameter (Aza'is and Wschebor [7]). Let us 
remark that Theorem [3] holds true for non-Gaussian processes under appropriate conditions 
allowing to apply Rice formula. 

Our method can be exploited to study higher order differentiability of Fm (as it has been 
done in [7J for one-parameter processes) but we will not pursue this subject here. 

This paper is organized as follows: 

Section 2 includes an extension of Rice Formula which gives an integral expression for the 
expectation of the weighted number of roots of a random system of d equations with d real 
unknowns. A complete proof of this formula in a form which is adapted to our needs in this 
paper, can be found in [9]. There is an extensive literature on Rice formula in various contexts 
(see for example Belayiev [TO J , Cramer-Leadbetter [15], Marcus [23], Adler p], Wschebor [55] . 

In Section 3, we obtain the exact expression for the distribution of the maximum as a conse- 
quence of the Rice-like formula of the previous section. This immediately implies the existence 
of the density and gives the implicit formula for it. The proof avoids unnecessary technicalities 
that we have used in previous work, even in cases that are much simpler than the ones consid- 
ered here. 

In Section 4, we compute (Theorem the first order approximation in the direct method 
for stationary isotropic processes defined on a polyhedron, from which a new upper bound for 
P{M > u} for all real it follows. 

In Section 5, we consider second order approximation, both for the direct method and the 
EPC approximation method. This is the content of Theorems [5j [6] and [71 

Section 6 contains some examples. 
Assumptions and notations 

X = {X(t) : t E S} denotes a real- valued Gaussian field defined on the parameter set S. We 
assume that S satisfies the hypothesis Al 

Al : 

• S is a compact subset of M d 
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• S is the disjoint union of Sj, S^-i-.., So, where Sj is an orientable C 3 manifold of dimension 
j without boundary. The Sj's will be called faces. Let Sd , do < d be the non empty face 
having largest dimension. 

• We will assume that each Sj has an atlas such that the second derivatives of the inverse 
functions of all charts (viewed as diffeomorphisms from an open set in W to Sj) are 
bounded by a fixed constant. For t £ Sj, we denote Lt the maximum curvature of Sj at 
the point t. It follows that L t is bounded for t G S. 

Notice that the decomposition S = Sd U ... U So is not unique. 
Concerning the random field we make the following assumptions A2-A5 

A2 : X is in fact defined on an open set containing S and has C 2 paths 

A3 : for every t G S the distribution of (X(t), X'(t)) does not degenerate; for every s,t G S, 
s j^t, the distribution of (X(s), does not degenerate. 

A4 : Almost surely the maximum of X(t) on S is attained at a single point. 

For t £ Sj, Xj(t) X'- N {t) denote respectively the derivative along Sj and the normal deriva- 
tive. Both quantities are viewed as vectors in M. d , and the density of their distribution will be 
expressed respectively with respect to an orthonormal basis of the tangent space T t j of Sj at 
the point t, or its orthogonal complement N t j. X'-(t) will denote the second derivative of X 
along Sj, at the point t £ Sj and will be viewed as a matrix expressed in an orthogonal basis 
of Tf j. Similar notations will be used for any function defined on Sj. 

A5 : Almost surely, for every j = 0,1, ... ,d there is no point t in Sj such that X'-(t) = 0, 
det(x;(t)) = 

Other notations and conventions will be as follows : 

• Gj is the geometric measure on Sj. 

• m(t) := E(X(t)), r(s,t) = Cov(X(s), X(t)) denote respectively the expectation and co- 
variance of the process X ; ro,i(s,t), ro t 2{s,t) are the first and the second derivatives of r 
with respect to t. Analogous notations will be used for other derivatives without further 
reference. 

• If rj is a random variable taking values in some Euclidean space, p v (x) will denote the 
density of its probability distribution with respect to the Lebesgue measure, whenever it 
exists. 

• (f(x) = (27r) -1 / 2 exp(— x 2 /2) is the standard Gaussian density ; := J^ oQ Lp{y)dy. 

• Assume that the random vectors £, rj have a joint Gaussian distribution, where r\ has 
values in some finite dimensional Euclidean space. When it is well defined, 

e(/(6A? = x) 

is the version of the conditional expectation obtained using Gaussian regression. 

• E u := {t G S : X(t) > u} is the excursion set above u of the function X(.) and A u := 
{M < u} is the event that the maximum is not larger than u. 

• (, ), || ||, denote respectively inner product and norm in a finite-dimensional real Euclidean 
space; is the Lebesgue measure on R d ; S 8 " 1 is the unit sphere ; A c is the complement 
of the set A. If M is a real square matrix, M y denotes that it is positive definite. 
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• If g : D — > C is a function and u G C, we denote 

JV*(D) : = (t{t e £) : 5 ( t ) = n } 
which may be finite or infinite. 

Some remarks on the hypotheses 

One can give simple sufficient additional conditions on the process X so that A4 and A5 hold 
true. 

If we assume that for each pair j, k = 0, . . . , d and each pair of distinct points s,t, s G Sj,t G 
Sk, the distribution of the triplet 

(*(t)-X(a),A$( a ),^(t))) 

does not degenerate in E x W x R k , then A4 holds true. 

This is well-known and follows easily from the next lemma (called Bulinskaya 's lemma) 
that we state without proof, for completeness. 

Lemma 1 Let Z(t) be a stochastic process defined on some neighborhood of a set T embedded 
in some Euclidean space. Assume that the Hausdorff dimension of T is smaller or equal than 
the integer m and that the values of Z lie in M. m+ for some positive integer k . Suppose, in 
addition, that Z has C l paths and that the density Pz(t)( v ) * s bounded for t G T and v in some 
neighborhood of u G M m+fe . Then, a. s. there is no point iGT such that Z(t) = u. 

With respect to A5, one has the following sufficient conditions: Assume Al, A2, A3 and as 
additional hypotheses one of the following two: 

• t ~» X(t) is of class C 3 

sup P(| det (X"{t)) | < 6/X'(t) = x') -► 0, as 5 -c 0, 

teS,x'eV{o) 

where V(0) is some neighborhood of zero. 
Then A5 holds true. This follows from Proposition 2.1 of [8j and |16j . 

2 Rice formula for the number of weighted roots of random 
fields 

In this section we review Rice formula for the expectation of the number of roots of a random 
system of equations. For proofs, see for example [8], or [9], where a simpler one is given. 

Theorem 1 (Rice formula) Let Z : U -^-R d be a random field, U an open subset of M. d and 
u G M. d a fixed point in the codomain. Assume that: 

(i) Z is Gaussian, 

(ii) almost surely the function t ~^ Z{t) is of class C 1 , 

(Hi) for each t G U, Z{t) has a non degenerate distribution (i.e. Var(Z(t)) y 0), 
(iv) P{3t G U, Z(t) = u, det (Z'(t)) = 0} = 

Then, for every Borel set B contained in U , one has 

E (NZ(B)) = [ E (| det(Z'(t))\/Z(t) = u) Pz{t) {u)dt. (2) 

If B is compact, then both sides in p|) are finite. 
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Theorem 2 Let Z be a random field that verifies the hypotheses of Theorem^ Assume that 
for each t G U one has another random field Y t : W — ► M. d , where W is some topological space, 
verifying the following conditions: 

a) Y l (w) is a measurable function of (ui,t,w) and almost surely, (t,w) ~^ Y t (w) is continu- 
ous. 

b) For each t G U the random process (s, w) (Z(s), Y t (w)) defined onU xW is Gaussian. 

Moreover, assume that g : U x C(W,R d ) — ► R is a bounded function, which is continuous when 
one puts on C(W, M d ) the topology of uniform convergence on compact sets. Then, for each 
compact subset I of U , one has 




Remarks: 

1. We have already mentioned in the previous section sufficient conditions implying hypoth- 
esis (iv) in Theorem [TJ 

2. With the hypotheses of Theorem Q] it follows easily that if J is a subset of U, A^(J) = 0, 
then P{N£(J) = 0} = 1 for each u G R d . 

3 The implicit formula for the density of the maximum 

Theorem 3 Under assumptions Al to A5, the distribution of M has the density 

PM {x) = J2 E ( W*(*) = x)Px(t)(x) 
tes 

d „ 

+ J2 / E(|det(X;(t))| l A JX(t) = x,X l j (t) = 0) Pxit)!X , {t) (x,0)a j (dt), (4) 
j= l J Si 

Remark: One can replace | det(X"(t))\ in the conditional expectation by (— l)- 7 det(X"(t)), 
since under the conditioning and whenever M < x holds true, X'-{t) is negative semi-definite. 

Proof of Theorem [3] 

Let Nj(u),j = 0, . . . , d be the number of global maxima of X(.) on S that belong to Sj and are 
larger than u. From the hypotheses it follows that a.s. Ylj=o d^j(u) is equal to or 1, so 
that 

p{m>u}= £ p{jy,-(«) = 1} = Y, e ( n j( u ))- ( 5 ) 

j=o,...,d j=o,...,d 

The proof will be finished as soon as we show that each term in ([5]) is the integral over (u, +oo) 
of the corresponding term in ([4]). 

This is self-evident for j = 0. Let us consider the term j = d. We apply the weighted Rice 
formula of Section [2] as follows : 

• Z is the random field X' defined on Sd- 

• For each t G S d , put W = S and Y t : S R 2 defined as: 

Y\w) := (X(w)-X(t),X(t)). 
Notice that the second coordinate in the definition of Y l does not depend on w. 



6 



• In the place of the function g, we take for each n = 1,2,... the function g n defined as 
follows: 

9n{tiflih) = 9n{h,f2) = (1 - -^n(sup/i(w))).(l - T n (u - f 2 (w))) , 

w&S 

where w is any point in W and for n a positive integer and x > 0, we define : 

F n (x) := F{nx) ; with F{x) = if < x < 1/2 , F{x) = 1 if x > 1 , (6) 
and .F monotone non-decreasing and continuous. 

It is easy to check that all the requirements in Theorem [2] are satisfied, so that, for the value 
instead of u in formula Q we get: 

E( Yl 9n(Y t ))= f E(\det(X"(t)\g n (Y t )/X , (t) = 0).p x , (t) (0)\ d (dt). (7) 
tes d ,x'(t)=o JSd 

Notice that the formula holds true for each compact subset of Sd in the place of S d , hence for 
Sd itself by monotone convergence. 

Let now n ->• oo in (|2J). Clearly g^Y*) [ lx(s)-X(t)<o,Vses- ^x(t)>u- The passage to the limit 
does not present any difficulty since < g n (Y t ) < 1 and the sum in the left-hand side is bounded 
by the random variable (Sd), which is in L l because of Rice Formula. We get 

E(N d (u)) = [ E(|det(X"(i)| l X (s)-x(t)<oyses l X (t)>jX'(t) = 0).p x/{t) (0)X d (dt) 
Js d 

Conditioning on the value of X(t), we obtain the desired formula for j = d. 

The proof for 1 < j < d — 1 is essentially the same, but one must take care of the parame- 
terization of the manifold Sj . One can first establish locally the formula on a chart of Sj , using 
local coordinates. 

It can be proved as in [8], Proposition 2.2 (the only modification is due to the term 1a x ) 
that the quantity written in some chart as 

E(det(y"(s)) 1aJY(s) = x,Y'(s) = 0)p Y{s) ^ {s) (x,0)ds, 
where the process Y(s) is the process X written in some chart of Sj , 

(y(s) = X (4>~ l (s))) , defines a j-form. By a j-form we mean a mesure on Sj that does not 
depend on the parameterization and which has a density with respect to the Lebesgue measure 
ds in every chart. It can be proved also that the integral of this j-form on Sj gives the 
expectation of Nj(u). 

To get formula ([2]) it suffices to consider locally around a precise point t £ Sj the chart (j) 
given by the projection on the tangent space at t. In this case we obtain that at t 

• ds is in fact aj(dt) 

• y'(s) is isometric to X'j(t) 

where s = 4>(t). □ 

The first consequence of Theorem [3] is the next corollary. For the statement, we need to 
introduce some further notations. 

For t in Sj, j < do we define Ctj as the closed convex cone generated by the set of directions: 

{A G R d : ||A|| = 1 ;3 s n G S, (n = 1, 2, . . .) such that s n -> t, j ^ -> A as n +oo}, 

\\t Sn || 
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whenever this set is non-empty and Cfj = {0} if it is empty. We will denote by Ct.j the dual 
cone of Ctj, that is: 

C t j := {z G R d : (z, A) > for all A G C tJ }. 

Notice that these definitions easily imply that Ttj C Ctj and Ctj C Ntj- Remark also that for 
j = do, C t j = N t j. 

We will say that the function X(.) has an "extended outward" derivative at the point t in 
SjJKdo ifX' jjN (t)eC ttj . 

Corollary 1 Under assumptions Al to A5, one has : 

(a) pm{x) < p{x) where 

p(x) := J>( l x , md jX(t)=x)p m (x)+ 
teS 
do a 

E(|det(x;(t))| l x , )e£ /X(t)=x,X' j (t)=0) Px{tlxl(t) (x,0)a j (dt). (8) 

J I 7 • 

r+oo 

(b) P{M >u} < p{x)dx. 

J u 

Proof 

(a) follows from Theorem [3] and the observation that if t G Sj , one has 

{M < X(t)} C {X'j N (t) G Ctj}. (b) is an obvious consequence of (a). □ 

The actual interest of this Corollary depends on the feasibility of computing p(x). It turns 
out that it can be done in some relevant cases, as we will see in the remaining of this section. 
Our result can be compared with the approximation of P{M > u} by means of f^°°p E (x)dx 
given by [3], [M] where 

p E {x) := Y, E( l x , me jX(t) = x)p m {x) 
teSb 

dp . 

+ ^(-l) j / E(det(X/(i)) l x , (t)eS /X(t) = x J ^(t) = 0)p m ^ (t) (x,0)(7 i (rft). (9) 

Under certain conditions , p E {x)dx is the expected value of the EPC of the excursion set 
E u (see [3]). The advantage of p E (x) over p(x) is that one can have nice expressions for it in 
quite general situations. Conversely p(x) has the obvious advantage that it is an upper-bound 
of the true density pm(x) and hence provides upon integrating once, an upper-bound for the 
tail probability, for every u value. It is not known whether a similar inequality holds true for 
p E (x). 

On the other hand, under additional conditions, both provide good first order approximations 
for pm{x) as x — > oo as we will see in the next section. In the special case in which the process 
X is centered and has a law that is invariant under isometries and translations, we describe 
below a procedure to compute p{x). 
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4 Computing p(x) for stationary isotropic Gaussian fields 



For one-parameter centered Gaussian process having constant variance and satisfying certain 
regularity conditions, a general bound for pm(%) has been computed in [8], pp. 75-77. In the 
two parameter case, Mercadier [26] has shown a bound for P{M > u}, obtained by means of a 
method especially suited to dimension 2. When the parameter is one or two-dimensional, these 
bounds are sharper than the ones below which, on the other hand, apply to any dimension but 
to a more restricted context. We will assume now that the process X is centered Gaussian, 
with a covariance function that can be written as 

E(X( S ).X(t))=p(\\s-t\\ 2 ), (10) 

where p : R + — > R is of class C 4 . Without loss of generality, we assume that p(0) = 1. 
Assumption (jlO)) is equivalent to saying that the law of X is invariant under isometries (i.e. 
linear transformations that preserve the scalar product) and translations of the underlying 
parameter space R rf . 

We will also assume that the set S is a polyhedron. More precisely we assume that each 
Sj(j = 1, . . . , d) is a union of subsets of affine manifolds of dimension j in W 1 . 

The next lemma contains some auxiliary computations which are elementary and left to the 
reader. We use the abridged notation : p' := p'(0), p" := p"(0) 

Lemma 2 Under the conditions above, for each t £ U, i, i', k, k', j = 1, . . . , d: 

1. E(f (t).X(t))=0, 

2. E(§£(*).g£(t)) = -2p'5 ik and p' < 0, 

3- E(S( t )^))= 2 A,E(||(t).|(i))=0 

^- E ( dhdtk ■ W, dt y (0) = 24p // [^/.5/ ! fc/ + Si'k-Sik' + fe^'A;']) 
5. p" - p' 2 > 

d?. If t £ Sj, the conditional distribution of X'-(t) given X(t) = x,Xj(t) = is the same as 
the unconditional distribution of the random matrix 



Z + 2p'xlj , 

where Z = {Zq. : i,k = is a symmetric j x j matrix with centered Gaussian 

entries, independent of the pair (X(t),X'(t)) such that, fori < k, i' < k' one has : 

E(Z ik Zi> k >) = 4[2p"Su> + (p" - p' 2 )]5ikO~i>k' + 4p"5u>.5kk>(l - $ik) ■ 

Let us introduce some additional notations: 

• H n (x),n = 0, 1, . . . are the standard Hermite polynomials, i.e. 

For the properties of the Hermite polynomials we refer to Mehta [25] . 

• H n (x),n = 0, 1, . . . are the modified Hermite polynomials, defined as: 

H n (x) :=^l\-^-Je-* 2 l* 
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We will use the following result: 
Lemma 3 Let 

/+oo 
e- y2 / 2 H n (v)dy, 71 = 0,1,2,... (11) 
-oo 

where v stands for the linear form v = ay + bx where a, b are some real parameters that satisfy 
a 2 + b 2 = 1/2. Then _ 

J n {x) := (2b) n V2^H n (x). 

Proof : 

It is clear that J n is a polynomial having degree n. Differentiating in (jlip under the integral 
sign, we get: 

/+oo r+oo 
e- y2/2 H' n {v)dy = 2nb / e~^ /2 H n -i(i/)dy = 2n b J„_i(x) (12) 
-oo J —oo 

Also: 

Jn(0)= / e- y2 / 2 H n (ay)dy, 



so that J n (0) = if n is odd. 

If n is even, n > 2, using the standard recurrence relations for Hermite polynomials, we have: 

/+oo 
e~ y2 / 2 [2ayH n ^ (ay) - 2(n - l)H n _ 2 {ay)]dy 
-oo 

e-» l 2 R' n _ x {ay)dy - 2(n - l)J„- 2 (0) 

-oo 

= -46 2 (n-l)J n _ 2 (0). (13) 
Equality (fT3|) plus Jo (x) = -v/^tt for all i£R, imply that: 

hM = (-\y(2bfv(2 v - l)!!v^F = (-26 2 f Ml v^F. (14) 

p! 

Now we can go back to (|12p and integrate successively for n = 1,2, .. . on the interval [0,x] 
using the initial value given by (|14p when n = 2p and J n (0) = when n is odd, obtaining : 

J n (x) = (2b) n V2^Q n (x), 

where the sequence of polynomials Q n , n = 0, 1, 2, . . . verifies the conditions: 

Qo(x) = 1 (15) 

Q' n {x) = nQ n {x) (16) 

Q n (0) =0 if n is odd (17) 

Q n (0) = (-l) n / 2 (n - 1)!! if n is even. (18) 

It is now easy to show that in fact Q n (x) = H n (x) , n = 0, 1, 2, . . . using for example that: 

H n {x) = 2 n ' 2 H n {^=). 

□ 

The integrals 



r+oo 

I n (v) = / e- t2 ' 2 H n {t)dt, 

J V 



will appear in our computations. They are computed in the next Lemma, which can be proved 
easily, using the standard properties of Hermite polynomials. 
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Lemma 4 (a) 



m-i i 



J B («) = 2e- 2 / 2 £ 2 * ( _" _^ Vi- a ( P ) (19) 
fc=o 1 ; " 
+ I{„ even} 2* (n - l)Hv^F *(x) (20) 



/ n (-oo) = I {n even}25 (n - 1)!! V^F (21) 



Theorem 4 Assume that the process X is centered Gaussian, satisfies conditions A1-A5 with 
a covariance having the form (10\) and verifying the regularity conditions of the beginning of this 
section. Moreover, let S be a polyhedron. Then, p(x) can be expressed by means of the following 
formula: 

Y,Mt)+i^[(}^) j,2 H j {x)+R j {x)\g j \ , (22) 



p(x) = tp(x) < 

where 



IT 

teS j=i 



gj is a geometric parameter of the face Sj defined by 

9j = [ d (t)o- (dt), (23) 



where dj{t) is the normalized solid angle of the cone Ctj in N t j, that is: 

— / 1 \ vd-j-i{Ct,j n s _ . . . 

0j(*) = (cd-i-u forj=0,...,d-l, (24) 

a d (t) = 1. (25) 

Notice that for convex or other usual polyhedra &j(t) is constant for t E Sj, so that gj is 
equal to this constant multiplied by the j -dimensional geometric measure of Sj. 



For j = l,...d, 

RAx) = i 



2p" ^ r((j+l)/2 [+°° y\ 



where 
and 



v ---(2)- 1 l 2 ((l- 1 2 ) 1 l 2 y- 1 x) with 7 := \p'\(p'T 1/2 (27) 



3 1 Hi(v) 1 _2 /0 HAv) 



k=0 KJ ' 

where I n is given in the previous Lemma. 

For the proof of the theorem, we need some ingredients from random matrices theory. 
Following Mehta [25J, denote by q n {y) the density of eigenvalues ofnxn GOE matrices at the 
point v, that is, q n {y)dv is the probability of G n having an eigenvalue in the interval (y, u + dv). 
The random nxn real random matrix G n is said to have the GOE distribution, if it is symmetric, 
with centered Gaussian entries gik,i, k = 1, . . . ,n satisfying E(<j£) = 1, ~E(g 2 k ) = 1/2 if i < ft 
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and the random variables: {gik, 1 < i < k < n} are independent. 
It is well known that: 



n-1 

,2 , 



k=0 

f+co r+oo 

+ l/2{n/2) 1 l 2 c n - X c n H nr . 1 {u) / e~ y2 ' 2 H n (y)dy - 2 / ' 2 H n {y)dy 



Ki<k<n i=l 



where c k := (2 fc /fc!0F)- 1 /2 ; A; = 0, 1, . . ., (see Mehta [25], ch. 7.) 

In the proof of the theorem we will use the following remark due to Fyodorov [18] that we 
state as a Lemma 

Lemma 5 Let G n be a GOE n x n matrix. Then, for ygl one has: 

E(| det(G n - vl n )\) = 2 3 / 2 r((n + 3)/2) exp(^/2)^^, (30) 

Proof: 

Denote by v\, . . . ,u n the eigenvalues of G n . It is well-known (Mehta [25], Kendall et al. [19J) 
that the joint density f n of the n-tuple of random variables (fx, . . . , u n ) is given by the formula 

f n (v u ...,v n ) = c n exp(-^f^) J] \ Vk -ut\ , withc n :=(27r)-™/ 2 (r(3/2)r(nr(l+i/2))^ 
Then, 

n 

E(\det(G n -uI n )\) =E(Y[\vi-v\) 

i=l 

= IT - ^|c n exp( 1=1 % ) \\ \v k -Vi\ dvx,...,du n 

/mm Z 

JK 1=1 l<i<k<n 

u 2 /2 C n f , / s, , u 2 /2 °n Qn+l(v) 

= e ' / f n+ i{ux,...,v n ,u)dux,...,dv n = e ' — — . 

c n+l JR™ c n+l n+ L 

The remainder is plain. □ 
Proof of Theorem |4) 

We use the definition (|8|) given in Corollary Q] and the moment computations of Lemma [2] which 
imply that: 

Px(t)(x) = <p(x) (31) 
Px(t),x>(t)(x,0) = v{x){2K)-^ 2 {-2p')-j/ 2 (32) 

X'(t) is independent of X(t) (33) 
X' j)N (t) is independent of (X'j (t) , X (t) , X'j (t)) . (34) 

Since the distribution of X'(t) is centered Gaussian with variance —2p'Id, it follows that : 

E ( 1 x'(t)ecJ X ( t ) = x )= Mt) if * e So, 



12 



and if t G Sj , j > 1 : 

E(| det(X;(t))| l x , At)eS jX(t) = x,Xj(t) = 0) 

= %(t) E(|det(Xj'(t))|/A:(t) =x,Xj(t) = 0) 

= dj(t) E(| det(Z + 2 /9 / x/ j )|). (35) 

In the formula above, &j (t) is the normalized solid angle defined in the statement of the theorem 
and the random j x j real matrix Z has the distribution of Lemma [2] . 

A standard moment computations shows that Z has the same distribution as the random matrix: 

where Gj is a j x j GOE random matrix, £ is standard normal in R and independent of Gj. 
So, for j > 1 one has 

/+oo 
E(|det(G i -i/J i )|)v(y)dy, 
-oo 

where ^ is given by (|27p . 

For the conditional expectation in ([8]) use this last expression in ([35]) and ©. For the density 
in (|a|) use ®. Then Lemma [3] gives §22$. □ 



Remarks on the theorem 

• The "principal term" is 



tp(x) < 



do I /i 

te5 i=i 



which is the product of a standard Gaussian density times a polynomial with degree do- 
Integrating once, we get -in our special case- the formula for the expectation of the EPC 
of the excursion set as given by [3] 

The "complementary term" given by 

d 

(p(x)^2R j (x)g j , (37) 

3=1 

can be computed by means of a formula, as it follows from the statement of the theorem 
above. These formulae will be in general quite unpleasant due to the complicated form of 
Tj(v). However, for low dimensions they are simple. For example: 

T x {v) = V2^[ip(v) - v(l -*(«))], (38) 
T 2 {v) = 2V2^<p(v), (39) 

T 3 (v) = ^[3(2^ + lMt;) " (2t> 2 - 3)u(l - $(«))] . (40) 
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Second order asymptotics for pm( x ) as x — > +00 will be mainly considered in the next 
section. However, we state already that the complementary term (|37p is equivalent, as 
x — ► +00, to 

<p(x) g d0 K d0 x 2d «- A e~^ x \ (41) 
where the constant Kj, j = 1, 2, ... is given by: 

K] ~ Z 0F(27r 7 )i/2(i-l)! P h-^ ■ (42) 

We are not going to go through this calculation, which is elementary but requires some 
work. An outline of it is the following. Replace the Hermite polynomials in the expression 
for Tj{v) given by (|28p by the well-known expansion: 

t- 7 '/ 2 ! (n \j-2i 

»» = g^'jfe («) 

and Ij-i(v) by means of the formula in Lemma HI 

Evaluating the term of highest degree in the polynomial part, this allows to prove that, 
as v — > +00, Tj(v) is equivalent to 

03— 1 2 

^'"VHr. (44) 



Using now the definition of i?j (x) and changing variables in the integral in (126j) , one gets 
for Rj(x) the equivalent: 



1 !_„2 



KjX**-* e . (45) 



In particular, the equivalent of (|37p is given by the highest order non-vanishing term in 
the sum. 

Consider now the case in which S is the sphere and the process satisfies the same 

conditions as in the theorem. Even though the theorem can not be applied directly, 
it is possible to deal with this example to compute p(x), only performing some minor 
changes. In this case, only the term that corresponds to j = d — 1 in (JSj) does not vanish, 
C* d-i = N* d-l > so that l v , u\^r> = 1 for each t £ S d_1 and one can use invariance 
under rotations to obtain: 



p(x) = <p(x) ^[f^ Efl det(Z + 2/^-i) + (2|p'|) 1 / 2 r ? I^ 1 |) (46) 

where Z is a (d — 1) X (d — 1) centered Gaussian matrix with the covariance structure 
of Lemma [2] and rj is a standard Gaussian real random variable, independent of Z. (|46j) 
follows from the fact that the normal derivative at each point is centered Gaussian with 
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variance 2\p'\ and independent of the tangential derivative. So, we apply the previous 
computation, replacing x by x + (2|p'|) -1 / 2 rj and obtain the expression: 

2vr d / 2 

[ +0 ° [i}Af-^H d _ l{x + (2 |p'|)- l/2 y) + Rd _ l{x + (2| P '|)- 1 /2y)] ( 47 ) 
J —oo 7I " 

5 Asymptotics as n +00 

In this section we will consider the errors in the direct and the EPC methods for large values 
of the argument x. Theses errors are: 

p(x)-p M (x) = E ( 1 X'(t)eC tfi - 1 M>x/X{t) = x)p x(t) (x) 
tes 

do p 

+ ^2 E (l det(Xj'(t)| l x , Nm£t .. 1 M>X )/X(t) = XtXfi) = 0) Px{t)>x/jit) (x,0)aj(dt). (48) 
3=1 JS i '' 

P E (x) - P m(x) = ^ E ( 1 x'(t)ec t , - t M>x/X(t) = x)p X (t)(x) 
tes 

dp p 

+ / E(det(X;(t) t x , At)eStf Ui>,)/X{t) = x,X' s {t) = 0)p x{tim (x,0)a 3 (dt). 

j=l J Sj J ' 

(49) 

It is clear that for every real x, 

\p E {x) -p M {x)\ <p{x) -pu{x) 

so that the upper bounds for p{x) — pm(x) will automatically be upper bounds for 
\p E (x) — Pm(x)\- Moreover, as far as the authors know, no better bounds for \p E (x) — Pm{x)\ 
than for p(x) — pm(x) are known. It is an open question to determine if there exist situations 
in which p E {x) is better asymptotically than p{x). 

Our next theorem gives sufficient conditions allowing to ensure that the error 

p(x) -p M {x) 

is bounded by a Gaussian density having strictly smaller variance than the maximum variance 
of the given process X , which means that the error is super- exponentially smaller than pm(x) 
itself, as x — > +00. In this theorem, we assume that the maximum of the variance is not attained 
in S\Sd - This excludes constant variance or some other stationary-like condition that will be 
addressed in Theorem [6j As far as the authors know, the result of Theorem [5] is new even for 
one-parameter processes defined on a compact interval. 

For parameter dimension do > 1, the only result of this type for non-constant variance 
processes of which the authors are aware is Theorem 3.3 of [34j . 

Theorem 5 Assume that the process X satisfies conditions Al -A5. With no loss of generality, 
we assume that maxtgg Y&r(X(t)) = 1. In addition, we will assume that the set S v of points 
t £ S where the variance of X(t) attains its maximal value is contained in Sd (do > 0) the 
non-empty face having largest dimension and that no point in S v is a boundary point of S\Sd - 
Then, there exist some positive constants C, 5 such that for every x > 0. 

\p E (x) - p M {x)\ < p(x) - p M {x) < C<p(x(l + 5)), (50) 

where (p(.) is the standard normal density. 
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Proof : 



Let W be an open neighborhood of the compact subset S v of S such that distiW, (S\S do )) > 
where dist denote the Euclidean distance in W d . For t G Sj n W c , the density 

Px(t),xj(t)0»,O) 

can be written as the product of the density of X'- (t) at the point 0, times the conditional density 
of X(t) at the point x given that X'j(t) = 0, which is Gaussian with some bounded expectation 
and a conditional variance which is smaller than the unconditional variance, hence, bounded by 
some constant smaller than 1. Since the conditional expectations in (|48p are uniformly bounded 
by some constant, due to standard bounds on the moments of the Gaussian law, one can deduce 
that: 

p(x)- PM (x)= [ E(|det(X£(i))| l x , 6 . l M>x /X(t) = x,X' do (t) = 0) 
Jwns do d o> Ny ' Ud ° 

■Px(t),x' do (t){x,0)a (h (dt) + 0(<p((l + h)x)), (51) 

as x — ► +oo, for some 5\ > 0. Our following task is to choose W such that one can assure 
that the first term in the right hand- member of (|5ip has the same form as the second, with a 
possibly different constant 5\. 

To do this , for s G S and t G Sd , let us write the Gaussian regression formula of X(s) on the 
pair (X(t),X' do (t)): 

X(s) = a\s)X(t) + + ^^X\s). (52) 

where the regression coefficients a t (s),b t (s) are respectively real- valued and R rf ° -valued. 
From now onwards, we will only be interested in those t G W. In this case, since W does not 
contain boundary points of S\Sd , it follows that 

CU = AU ^d l^ o w(t)e c Mo = L 

Moreover, whenever s G S" is close enough to i, necessarily, s G and one can show that 
the Gaussian process {X l (s) : t G W D S^jS G 5} is bounded, in spite of the fact that its 
trajectories are not continuous at s = t. For each t, {X t (s) : s G S} is a "helix process", see [8] 
for a proof of boundedness. 

On the other hand, conditionally on X{t) = x, X' do (t) = the event {M > x} can be written as 

{X*(s) > /3*(s) x, for some s G S} 

where 

= (53) 

||r — s\\ z 

Our next goal is to prove that if one can choose W in such a way that 

inf{/3'(s) : t G W n 5 do , s G 5, s / t} > 0, (54) 

then we are done. In fact, apply the Cauchy-Schwarz inequality to the conditional expectation 
in (|51|) . Under the conditioning, the elements of X' do (t) are the sum of affine functions of x 
with bounded coefficients plus centered Gaussian variables with bounded variances, hence, the 
absolute value of the conditional expectation is bounded by an expression of the form 

(Q(t,*)) 1/2 (P( sup ^>*)) 1/2 , (55) 
V s€S \m Pys) J 
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where Q(t, x) is a polynomial in x of degree 2do with bounded coefficients. For each t £ W(lSd , 
the second factor in ([55]) is bounded by 

/ / X t (s) \\ 1/2 

/p(su P {-^ :tewns d0 ,ses,s^t} >x)\ . 

Now, we apply to the bounded separable Gaussian process 

rX t (s) 1 

[j^-j:teWnS do ,seS,s^tj 

the classical Landau-Shepp-Fernique inequality [20], [T7j which gives the bound 
P(sup{|^ :t€Wr\S do ,8€S,s^t}>x) < C 2 exp(-5 2 x 2 ), 

for some positive constants C 2 , S 2 and any x > 0. Also, the same argument above for the density 
Px(t),x' d (t)( x )0) shows that it is bounded by a constant times the standard Gaussian density. 
To finish, it suffices to replace these bounds in the first term at the right-hand side of (|5ip . 



It remains to choose W for (|54p to hold true. Consider the auxiliary process 

Y(s) := X(S) , s e S. (56) 
yV(s,s) 

Clearly, Var(Y(s)) = 1 for all s 6 S. We set 

r y (s, a') := Cov(Y(s), Y{s')) , s, *' € 5. 

Let us assume that t E S v . Since the function s ~» Var(X(s)) attains its maximum value at 
s = t, it follows that X(f),A^ u (i) are independent, on differentiation under the expectation 
sign. This implies that in the regression formula (|52p the coefficients are easily computed and 
a*(s) = r(s, t) which is strictly smaller than 1 if s ^ t, because of the non-degeneracy condition. 
Then 

2( 1 -r(s,t)) > 2(1 -r Y (s ,t)) 
It -sll 2 ~ \\t-s\ 



P\s) = ,u % " > || ' r 7 • (57) 



Since r Y (s,s) = 1 for every s 6 S, the Taylor expansion of r Y (s,t) as a function of s, around 
s = t takes the form: 

r Y (s, t) = l + (s-t, r Y do (t, t)(s - t)) + o(\\s - t\\ 2 ), (58) 

where the notation is self-explanatory. 

Also, using that Var(Y(s)) = 1 for s G S, we easily obtain: 

" ^.(M) = Var(Fj (t)) = Var(X^(i)) (59) 

where the last equality follows by differentiation in (|56p and putting s = t. (|59[) implies that 
~ r 20d ^ s uniformly positive definite on t G Sy, meaning that its minimum eigenvalue has 
a strictly positive lower bound. This, on account of (I57D and (j58H . already shows that 

inf{/3*(s) : t E S„, s E 5, s / i} > 0, (60) 

The foregoing argument also shows that 

mf {-T(a%(t)T :teS v ,re S*>"W *} > 0, (61) 



17 



since whenever t G S v , one has a t (s) = r(s,t) so that 

(a*)d ( t ) =r 20 ) do, (*,*)• 

To end up, assume there is no neighborhood W of S v satisfying (|54p . In that case using a 
compactness argument, one can find two convergent sequences {s n } C S , {i n } C Sd , s n — ► so, 
t n ^ to £ S v such that 

/? in (Sn) ->*<(). 

£ may be — oo. 

to 7^ so is not possible, since it would imply 

11*0 - «olr 

which is strictly positive. 

If to = so, on differentiating in ([52]) with respect to s along we get: 

f), 11/ _ <j||2 

*koo = («%,(«)*(*) + ((^dbW.^W) + it-^^x 1 ^ 

where (a 4 )^ (s) is a column vector of size do and (b*)d ( s ) is a x matrix. Then, one must 
have a*(i) = 1, (a*)' do (t) = . Thus 

/?*«(s n ) =-t^(o*°4(t K + o(l), 

where u n := (s n — t n )/\\s n — t n \\. Since to £ $u we may apply (JBTj) and the limit ^ of f3 tn (s n ) 
cannot be non-positive. □ 

A straightforward application of Theorem [5] is the following 

Corollary 2 Under the hypotheses of Theorem^ there exists positive constants C,S such that, 
for every u > : 



< 



r+oo r+oo 

j p E {x)dx - P(M >u) < p{x)dx - P(M > it) < CP(£ > u), 

J u J u 



where ^ is a centered Gaussian variable with variance 1 — 5 

The precise order of approximation of p(x) — Pm(x) or p E (x) — pu{x) as x — > +oo remains 
in general an open problem, even if one only asks for the constants a^, o 2 E respectively which 
govern the second order asymptotic approximation and which are defined by means of 

-2 := lim -2x~ 2 log \p(x) - p M (x)] (62) 

and 

— 5- := lim — 2x~ 2 log — pm{%) I (63) 

whenever these limits exist. In general, we are unable to compute the limits (1621) or (j63|) or 
even to prove that they actually exist or differ. Our more general results (as well as in |3j, |34J) 
only contain lower-bounds for the liminf as x — ► +oo. This is already interesting since it gives 
some upper-bounds for the speed of approximation for pm(x) either by p{x) or p E (x). On the 
other hand, in Theorem [7] below, we are able to prove the existence of the limit and compute 
a\ for a relevant class of Gaussian processes. 
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For the next theorem we need an additional condition on the parameter set S. For S 
verifying Al we define 



fa\ dist((t - s),C t j) 
K{b) = sup sup sup li (64) 

0<j<d teSj s£S,Sytt \\s — t\\ 

where dist is the Euclidean distance in W*. 

One can show that n{S) < oo in each one of the following classes of parameter sets S: 

- S is convex, in which case k(S) = 0. 

- S is a C 3 manifold, with or without boundary. 

- S verifies the following condition: For every t G S there exists an open neighborhood V of 
t in M. d and a C 3 diffeomorphism ifi : V — > B(0,r) (where B(0,r) denotes the open ball in M. d 
centered at and having radius r, r > 0) such that 

ip(V n S) = C n -B(0, r), where C is a convex cone. 

However, k(S) < oo can fail in general. A simple example showing what is going on is the 
following: take an orthonormal basis of R 2 and put 

S = {(A,0) : < A < 1} U {0 cos 9,n sin 9) : < fi < 1} 

where < 9 < tt, that is, S is the boundary of an angle of size 9. One easily checks that 
n(S) = +oo. Moreover it is known [3] that in this case the EPC approximation does not verify 
a super- exponential inequality. More generally, sets S having "whiskers" have k(S) = +oo. 



Theorem 6 Let X be a stochastic process on S satisfying Al -A5. Suppose in addition that 

VarpT(t)) = 1 for all t £ S and that k(S) < +oo. 

Then 

liminf — 2x~ 2 log \p(x) — Pm(x)] > 1 + inf — — = (65) 



with 



and 



where 



a t := sup 



tes ^ + \{t)K 2 t 
Vax(X(a)/X(t),X'(t)) 



sGS\{t} (l-r(a,t)) 



2 



dist( - A t 1 r i(s,t),C t j 
K t := sup — — t -, (66) 



s&S\{t} 



l-r(sA) 



• A t := Var(X'(i)) 

• A(t) is the maximum eigenvalue of At 

• in [66\) . j is such that t € Sj ,(j = 0, 1, . . . , do). 

The quantity in the right hand side of I165\) is strictly bigger than 1. 

Remark. In formula (|65p it may happen that the denominator in the right-hand side is 
identically zero, in which case we put +oo for the infimum. This is the case of the one-parameter 
process X(t) = £ cost + rjsint where £,77 are Gaussian standard independent random variables, 
and S is an interval having length strictly smaller than tt. 
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Proof of Theorem [B] 

Let us first prove that sup 4gS n t < oo. 

For each t £ S, let us write the Taylor expansions 

roiM) = r 01 {t,t)+r u {t,t)(s -t) + 0(\\s - t\\ 2 ) 
= A t ( S -t)+0(\\s-t\\ 2 ) 

where O is uniform on s, t £ S, and 

1 -r(s,t) = (s - t) T A t (s - t) + 0{\\s - t\\ 2 ) > L 2 ||s-t|| 2 , 

where L 2 is some positive constant. It follows that for s £ S, t £ Sj, s 7^ t, one has: 

^(-A^roiM),^) dist{(t- S ),C t ,j) T 

i^K^j 3 — ¥^W — + 4 ' (67) 

where L3 and L4 are positive constants. So, 



c&stf - A t 1 r i(s,t),Ct 1 j) 



< L 3 k(S) + L 4 . 



1 - r(s,i) 
which implies sup tg 5 n t < 00. 

With the same notations as in the proof of Theorem [SJ using @ and ([S]), one has: 



p(x) -pm{x) = <p(x) 



E E ( I ^( i)6 c t , - Ui> x /X(t)=x) 
teS 

+ £ / H\det(X»(t))\ l x , {t)e c tJ . l M jm = x,X>(t) = 0) 

(2^)- J '/ 2 (det(Var(X'(t))))- 1 /V j (^) 



i=i " ^ 



Proceeding in a similar way to that of the proof of Theorem [5j an application of the Holder 
inequality to the conditional expectation in each term in the right-hand side of (|68p shows that 
the desired result will follow as soon as we prove that: 

liminf-2x- 2 logP({X' N £ C t>j } n {M > x}/X(t) = x,X'(t) = 0) > - * (69) 

for each j = 0, 1, . . . , do, where the liminf has some uniformity in t. 
Let us write the Gaussian regression of X(s) on the pair (X(t), X' (i)) 

X(s) = a\s)X(t) + (b\a),X'(t)) + R\s). 

Since X{t) and X'{t) are independent, one easily computes : 

a\s) = r(s,t) 
b t (s)=At 1 r 01 (s,t). 

Hence, conditionally on X(t) = x, X'-{t) = 0, the events 

{M > x} and {R\s) > (1 - r(s,t))x - rQ 1 (s,t)A^ 1 X' jN (t) for some s £ S} 
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coincide. 

Denote by (Xj )iV (i)|Xj(i) = 0) the regression of X' jN (t) on Xft) = 0. So, the probability in 
(1691) can written as 



nc\s) > x - r oi(M)A, v for gome s e 5fe it)=o{x>)dx , (70) 



L , v ~, . ~ l-r(s,t) " ^ " }1 - a In 



where 

RHs) 



l-r(M) 

• dx' is the Lebesgue measure on Ntj. Remember that Ctj C Nt 
If — A < T 1 roi(s, i) G Ctj one has 

-r^M^V > 

for every x' G C^j, because of the definition of Ctj- 

If — A i T 1 roi(s, t) ^ Ctj, since Ctj is a closed convex cone, we can write 

-AT/ roi(a,i) = z' + z" 

with z' G Ctj , z'-Lz" and ||z"|| = dist(— A < T 1 roi(s, t),Ctj). 
So, if x' G Ctj : 

-r^ 1 (s,t)A < T 1 x / _ z /T x' + z //7 y > ^ , 
1 — r(s, t) 1 — r(s, t) ~ 

using that z' T x' > and the Cauchy-Schwarz inequality. It follows that in any case, if x' G Ctj 



the expression in (170p is bounded by 

/ P(C*(s) > x- K t \\x'\\ for some s G S^px^MIXj^oOc')^- ( 71 ) 

To obtain a bound for the probability in the integrand of (|71|) we will use the classical 
inequality for the tail of the distribution of the supremum of a Gaussian process with bounded 
paths. 

The Gaussian process (s,i)) ~^ C'( s )> defined on (S x S)\{s = t} has continuous paths. As 
the pair (s,t) approches the diagonal of S x S, C*( s ) may not have a limit but, almost surely, 
it is bounded (see [8] for a proof). (For fixed t, £*(.) is a "helix process" with a singularity at 
a = t, a class of processes that we have already met above). 

We set 

• m*(a) :=E(C*(s)) (a ^ t) 

• m := sup Sjte5jS ^ |m*(a)| 

• /i := E(| sup aiteS>a#t [C*(a) - m*(a)] |) . 

The almost sure boundedness of the paths of C*( s ) implies that m < oo and // < oo. Applying the 
Borell-Sudakov-Tsirelson type inequality (see for example Adler [2] and references therein) to 
the centered process s ~» C t (s)—m t (s) defined on 5'\{t} , we get whenever x — K t \\x'\\—m— fi > 0: 

P{C*(s) > x — «t[|a?'[| for some s G 5} 

< P{C*(s) — m*(s) > x — «t[|a/|| — m for some a G 5} 

. (x-K f ||x'|| - m-jif , 
<2exp( ^ ). 
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The Gaussian density in the integrand of (|TTj) is bounded by 



(27rXj(t))— exp 



^ \\x'-m>^ 2 



2XAt) 



where Xj (t ) and Xj (t) are respectively the minimum and maximum eigenvalue of Var {X 1 - N {t)\ X'- (t) ) 
and m'- N (t) is the conditional expectation EpTj N (t)\Xj(t) = 0). Notice that Xj(t), Xj(t),m'j N (t) 
are bounded, Xj(t) is bounded below by a positive constant and Xj(t) < A(i). 

Replacing into ([7T]) we have the bound : 



P{{x' j:N e c tjj } n {m > x}/x(t) = = o) 

< (27rA i (t)) ^ 2 / exp-( — - 2 + -=^- ) dx 

+ p(\\x^ N (t)\x' j (t) = o\\> x ~™~ l * )> ( ?2 ) 

where it is understood that the second term in the right-hand side vanishes if Kt = 0. 
Let us consider the first term in the right-hand side of (|72p . We have: 

(x - K t \\x'\\ - m - v) 2 \W - m' jN 



2af 2X(t) 



(x-KtWx'W-m-fi) 2 (\\x'\\ - \\m' jN {t)\\f 



M 2X(t) 

l2 (x — m — a — Kt\\m': N {t)\\Y 
[A{t)\\x'\\ + B(t)(x - m - fi) + C(t)] 2 + — 



2a\ + 2X(t)n 



where the last inequality is obtained after some algebra, A(t) , B (t) , C (t) are bounded functions 
and A(t) is bounded below by some positive constant. 



So the first term in the right-hand side of (|72p is bounded by : 

_ , n , ,tzA Ax - m - - Kt\\m' jN {t)f 

2.(27rA 1 ) 2 exp- =— -£ 

y ~ jJ K 2a 2 + 2X{t) K 2 

[ exv-[(A(t)\\x'\\ +B(t){x-m-n) + C{t))] 2 dx' 

. . - f (x-m- fi- Kt \\m' jN (t)\\) 2 ^ 

< L \x\ J exp— = J - (73) 

" ' V 2a 2 + 2A(t)K 2 ) V ' 

where L is some constant. The last inequality follows easily using polar coordinates. 

Consider now the second term in the right-hand side of (|72p . Using the form of the conditional 
density p X '. N (t)/x'-(t)=oi x ')> ^ follows that it is bounded by 

x — m — u — Kt\\m', M (t)\\ ^ 
V{\\{X'^ N (£)/X' 3 {t) = 0) - m' hN {t)\\ > * 



<L 1 |x|^- 2 exp-( ^ - ''I ) (74) 



(x-m- n- Kt\\m' j)N (t)\\) 



2 



2 



2A(^ 2 

where L\ is some constant. Putting together (JT3j) and ([731) with ([72]) . we obtain ([69]) . □ 
The following two corollaries are straightforward consequences of Theorem 
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Corollary 3 Under the hypotheses of Theorem^ one has 

liminf — 2x~ 2 log \p E (x) — Pm(x)\ > 1 + inf — — = -. 

x^+oo teS <jf + \{t)Kl 

Corollary 4 Let X a stochastic process on S satisfying Al -A5. Suppose in addition that 

E{X(t)) = 0, E(X 2 (t)) = 1, Var(X'(t) = I d for all t e S. 

Then 

r+oo 

P(M > u) - \ p E {x)dx 



lim inf — 2u 2 log 

u— >+oo 



> 1 + inf ' 



teS a\ + k 2 ' 



and 

do 



P E ( X ) = [j2i-iy(2^ 2 9 j H j (x)] l p(x). 

j=0 

where gj is given by 123\) and Hj(x) has been defined in Section^ 

The proof follows directly from Theorem [6] the definition of p E (x) and the results in pQ. 

6 Examples 

1) A simple application of Theorem [5] is the following. Let X be a one parameter real- valued 
centered Gaussian process with regular paths, defined on the interval [0, T] and satisfying an 
adequate non-degeneracy condition. Assume that the variance v(t) has a unique maximum, say 
1 at the interior point to, and k = min{j : v^\to) ^ 0} < oo. Notice that v^ 2k \to) < 0. Then, 
one can obtain the equivalent of pm {%) as x — > oo which is given by: 

Pm(x) * l ~ V " { ^' 2 V (l^- 1 ) x^(x), (75) 
kC k 

where ^ is a standard normal random variable and = — j^y^ (to) + ^[^"(^o)] 2 lfc=2- The 
proof is a direct application of the Laplace method. The result is new for the density of the 
maximum, but if we integrate the density from u to +oo, the corresponding bound for P{M > u} 
is known under weaker hypotheses (Piterbarg |28j). 

2) Let the process X be centered and satisfy A1-A5. Assume that the the law of the process 
is isotropic and stationary, so that the covariance has the form (|1U|) and verifies the regularity 
condition of Section SJ We add the simple normalization p' = p'(0) = —1/2. One can easily 
check that 

l-p 2 (\\s-t\\ 2 )-Ap' 2 (\\s-t\\ 2 )\\s-t\\ 2 

a t = sup h 7\\ 7ii2vi2 v 76 ) 

s&s\{t} ft - p(\\s - t\\ 2 )} 2 

Furthermore if 

p'(x) < for x > (77) 

one can show that the sup in ([76]) is attained as \\s — t\\ — > and is independent of t. Its value 
is 

a 2 = Up" - 1. 
The proof is elementary (see [4] or [34]). 

Let S be a convex set. For t G Sj, s £ S: 

dist(-r 0l {s,t),Ct,j) =dist(-2p'(\\s-t\\ 2 ){t-s),C t , j ). (78) 
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The convexity of S implies that (t — s) S C t j. Since C t j is a convex cone and — 2p'([|s — t\\ 2 ) > 0, 
one can conclude that —roi(s,t) 6 Ctj so that the distance in ([78]) is equal to zero. Hence, 

Kt = for every t £ S 

and an application of Theorem [6] gives the inequality 

2 1 

ShSE " ^ lo S b(^) " (s)J > 1 + , _ 1 • (79) 

A direct consequence is that the same inequality holds true when replacing p(x) — Pm{x) by 
\p E (x) — pm{x)\ in (|79p . thus obtainig the main explicit example in Adler and Taylor [3J, or in 
Taylor et al. [55] . 

Next, we improve (j79p . In fact, under the same hypotheses, we prove that the liminf is an 
ordinary limit and the sign > is an equality sign. We state this as 

Theorem 7 Assume that X is centered, satisfies hypotheses A1-A5, the covariance has the 
form < f77J|) with p'(0) = —1/2, p'{x) < for x > 0. Let S be a convex set, and do = d > 1. 
Then 

Remark Notice that since S is convex, the added hypothesis that the maximum dimension do 
such that Sj is not empty is equal to d is not an actual restriction. 

Proof of Theorem [7J 

In view of ([79]) . it suffices to prove that 

2 1 
limsup- — log [p(x) -p M {x)\ < 1 + — ^ -. (81) 

x^+oo 2^ IZp — 1 

Using Q and the definition of p(x) given by ([8]) , one has the inequality 

p(x) - PM (x) > (2vr)- d /V(x) f E(| det(X"(t))| 1 M>X /X(t) = x,X'(t) = 0)a d (dt), (82) 

Js d 

where our lower bound only contains the term corresponding to the largest dimension and we 
have already replaced the density Px(t),X'(t)( x , 0) by its explicit expression using the law of the 
process. Under the condition {X(t) = x,X'(t) = 0} if v T X"(t)v > for some v £ S^ 1 , a 
Taylor expansion implies that M > x. It follows that 

E(| det(X"(t))\ 1 M>X /X(t) = x,X'(t) = 0) 

> E(| det(X"(t))| i gup v T x „ {t)v > Q /X(t) = x,X'(t) = 0). (83) 



ves 



d-l 



We now apply Lemma [2] which describes the conditional distribution of X"(t) given X(t) = 
x,X'(t) = . Using the notations of this lemma, we may write the right-hand side of (j83[) as : 



Vl v n sup v Zv > x> 1 



which is obviously bounded below by 



E(\det(Z-xId)\ l Zll>x ) 

r+oo 2 
= J E(\det(Z -xId)\/Z n =y)(2Tr)- 1/2 o-~ 1 ex V (-^)dy, (84) 
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where a 2 := Var(Zn) = 12p" — 1. The conditional distribution of Z given Z\\ = y is easily 
deduced from Lemma [2l It can be represented by the random d x d real symmetric matrix 

( V Z12 Zi d \ 

£2 + ay Z22, ••• Z 2 d 



Z 



\ 



id + ay j 



where the random variables {£2, • • • > £,d, Zik, I < i < k < d} are independent centered Gaussian 
with 

Var(Z jfc )=4p" (1 < i < k < d) ; Varfe) = Wf/ '} 8 f ~ ^ (i = 2,...,d) ; a J/> " ~ ' 



12/9" -1 v ' ' ' 1 Up" -I 

Observe that < a < 1. 

Choose now ao such that (l + ao)a < 1. The expansion of det(Z — xld) shows that if x(l+ao) < 
y < x(l + ao) + 1 and x is large enough, then 

E(| det(Z - xld)\) > L a (l - a(l + a )) d_1 x d , 



where L is some positive constant. This implies that 



exp(-^)E(| det(Z-a:/d)|)dy > 



, . - /••••/ - /-r- / exp(-^ I )a (l-a(l+a ))' i 1 x d dy 

'2-KO J x 2a 2 V 27T<7 A(l+a ) 2| 7 

for x large enough. On account of (j82"j) .(|83" ]) .(|84 p . we conclude that for x large enough, 



— pm(x) > L\x d exp - 



x 2 (x(l + a ) + l) 



y + 



2a 2 



for some new positive constant L\. Since «o can be chosen arbitrarily small, this implies (|8ip . 

□ 

3) Consider the same processes of Example 2, but now defined on the non-convex set {a < 
\\t\\ < b}, < a < b. The same calculations as above show that Kt = 0ifa<||t||<6 and 



Kt = max - 



sup 



-2p'(z 2 )z 



, sup 



2ap'(2a 2 (l - cos0))(l - cosO) 



£[2a,a+6] 1 - Pl^J e G [o,7r] 1 - p(2a 2 (1 - COS 6>) ) 



for lltll = a. 



4) Let us keep the same hypotheses as in Example 2 but without assuming that the covari- 
ance is decreasing as in (J77J). The variance is still given by (f76l) but n t is not necessarily equal 
to zero. More precisely, relation (I78p shows that 



p'f I s — t 2 )+ \\s-t\ 
«t< sup 2^ ; 1 1 



The normalization: p' = —1/2 implies that the process X is "identity speed", that is 
Var(X'(i)) = 1^ so that X(t) = 1. An application of Theorem [6] gives 



liminf » log [pfx) - Pm(^)1 > 1 + 

x— >+oo X 



(85) 



where 



U 2 ) + ^l 



l-p 2 (z 2 )- W);z 2 4[g^ 

Za : " , e s ( u o P A] F*f + SI] FW 



and A is the diameter of S 1 . 



5) Suppose that 
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the process X is stationary with covariance T(t) := Cov(X(s), X(s + t)) that satisfies 
r(si, . . . ,Sd) = Ili=i d ^*( s «) where T%, are d covariance functions on R which are 
monotone, positive on [0, +00) and of class C , 



S is a rectangle 



i=l,...,d 



Then, adding an appropriate non-degeneracy condition, conditions A2-A5 are fulfilled and The- 
orem [6] applies 

It is easy to see that 



-ro,i(s,t) 



r'i(si - h)T 2 (s 2 -t 2 ) ... T d (s d - t d ) 



-h)... r d _i(s d _i - t d _i).r' d (s d - t d ) 



belongs to Ctj for every s E S. As a consequence = for all t £ S. On the other hand, 
standard regressions formulae show that 



Var(A( S )/A(t),A'(t)) _ I - Tj . . .T d - TfTl . .T d 



1 1 • • • 1 cZ-l 1 d 



(l-r( a ,t))5 



;i-r!...r d )2 



where Tj stands for r^Sj — tj). Computation and maximisation of o\ should be performed 
numerically in each particular case. 
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